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IN MEMORY OF GIAN-CARLO ROTA. 



We give new positive and negative results, some conditional, on speeding 
up computational algebraic geometry over the reals: 

1. A new and sharper upper bound on the number of connected components of a 
semi-algebraic set. Our bound is novel in that it is stated in terms of the volumes of 
certain polytopes and, for a large class of inputs, beats the best previous bounds by a 
factor exponential in the number of variables. 

2. A new algorithm for approximating the real roots of certain sparse polynomial sys- 
tems. Two features of our algorithm are (a) arithmetic complexity polylogarithmic in 
the degree of the underlying complex variety (as opposed to the super-linear dependence 
in earlier algorithms) and (b) a simple and efficient generalization to certain univariate 
exponential sums. 

3. Detecting whether a real algebraic surface (given as the common zero set of some 
input straight-line programs) is not smooth can be done in polynomial time within the 
classical Turing model (resp. BSS model over C) only if P = JVP (resp. NP C BPP). 

The last result follows easily from an unpublished observation of Steve Smale. 



1. INTRODUCTION AND MAIN RESULTS 

We provide new speed-ups for some fundamental computations in real 
algebraic geometry. Our techniques are motivated by recent results from 
algebraic geometry but the proofs are almost completely elementary. We 
then conclude with a discussion of how much farther these techniques can 
still be pushed. 
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In particular, we significantly improve the best previous upper bounds 
on the number of connected components of a semi-algebraic[] set, and we 
exhibit a new class of polynomial systems over the real numbers which can 
be solved within polylogarithmic time. As for complexity lower bounds, 
we show that if singularity detection for curves over C can be done in 
polynomial time then, depending on the computational model, we must 
have P = NP or NP C BPP. This can also be thought of as a lower 
bound on the complexity of elimination theory, and immediately implies 
an analogous result on singularity detection for real algebraic surfaces. 



This work is a part of an ongoing program by the author |Roj97, Roj9S 
Roj99l;, Roj99a| to dramatically sharpen current complexity bounds from 



algebraic geometry in terms of more intrinsic geometric invariants. We 
will give precise statements of these results shortly, so let us begin by 
considering the number of connected components of a semi-algebraic set. 

1.1. Sharper Intrinsic Bounds 



The topology of semi-algebraic sets is intimately related to complexity the- 
ory in many ways. For example, the seminal work of Dobkin, Lipton, Steele, 
and Yao |DL79| , ^Y8^ (see also [ pCSS98| , Ch. 16]) relates upper bounds 



on the number of connected components to lower bounds on the algebraic 
circuit complexity of certain problems. More directly, upper bounds on 
connected components are an important ingredient in complexity upper 
bounds for the first order theory of the reals [ BPR96t . 

Our first main theorem significantly improves earlier bounds on the num- 
ber of connected components by O leinik, Petrovsky, Milnor, Thom, and 
Basu [ |OP49| , |Mil64 [rho65 , |Bas96| The main novelty of our new bound 
is its greater sensitivity to the monomial term structure of the input polyno- 
mials. Letting O and respectively denote the origin and the i— standard 
basis vector in R^, X := (xi, . . . ,Xn), and normalizing fc-dimensional vol- 
ume Volfc(-) so that the standard fc-simplex := {a; G K*^ | xi, ■ ■ ■ ,Xn> 
0, J2j ^ 1} has volume 1, our result is the following. 



Main Theorem 1.1. 
Let /i, . . . , fp^s G ^[xi, . . . , x„] and suppose S C 
the following collection of polynomial inequalities: 



is the solution set of 



f^{x)=0, ie{l,...,p} 

^A semi-algebraic set is simply a subset of K" defined by the solutions of a finite 
collection of polynomial inequalities. 

^These papers actually bound the sum of the Betti numbers, which in turn is an 
upper bound on the number of connected components. Our bounds can be extended 
to bound the sum of the Betti numbers as well, but this extension will be addressed in 
future work. 
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fp+i{x) > 0, ie{l,...,s} 

Let Q C M" be the convex hull of the union of {O, ei, . . . , e^} and the set 
of all a with monomial term of some fi . Then S has at 

most 

min{n+l,^-i^}2"s"Vol„(g)(for s>0) or 2"-iVol„(Q)(for s = 0) 
s — 1 

connected components. 



In section H we show that this bound is at least as good as (and frequently 
much better than) the aforementioned earlier bounds. Our bound also 
considerably simplifies, and is competitive with, an earlier polytopal bound 
of Benedetti, Loeser, and Risler |BLR91, Prop. 3.6]. (We note that their 
polytopal bound, in addition to some minor restrictions on the fi, applies 
only when s — Q and p<n.) 

It is interesting to note that there are sharper (even optimal) upper 
bounds relating polytope volumes and connected components for complex 
varieties, beginning with the remarkable work of Bernshtein, Kushnircnko, 
and Khovanski | BKK76 | a bit over twenty years ago. (Se e also [ DK86| .^ 



However, as far as the author is aware. Main Theorem 1.1 presents the first 
nontrivial general upper bounds on the number of connected components of 
semi-algebraic sets with this combinatorial fiavor. The work of Benedetti, 



Loeser, and Risler [BLR91] appears to be the first occurence of polytopal 
bounds for the case where s = and p<n (i.e., certain real algebraic sets). 

Remark 1.1. Finding an optimal upper bound on the number of con- 
nected components of a semi- algebraic set, even in the special case of non- 
degenerate real algebraic sets, remains an open problem. ■ 



Our bound can be further improved in various ways and this is detailed in 
section |[ In particular, we give sharper versions tailored for certain special 
cases (e.g., compact hypersurfaces and real algebraic sets), and we prove 
analogues (for all our bounds) depending only on n, s, and the number 
of monomial terms which appear in at least one fi. Khovanski appears to 
have been the first to consider bounds of this type for the case where s = 
and p<n\ Kho91| . 



1.1 



when com- 



The techniques involved in our proof of Mai n Theor em 
bined with other recent results of the author [ R.oj99b |, also yield similar 



^ We also point out that the classical Bezout's thei 
small class of polynomial systems. So the results of 
as a very special case. 



,um95 is optimal only for a 
KK76| include Bezout's theorem 
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improvements on the complexity of quantifier elimination over real-closed 
fields. This will be pursued in a forthcoming paper of the author. 

1.2. Superfast Real Solving for Certain Fewnomial Systems 

The complexity of solving systems of fewnomials (polynomials with few 
monomial term^ has been addressed only recently. Indeed, the vast ma- 
jority of work in computational algebra has so far been stated only in terms 
of degrees of polynomials, thus ignoring the finer monomial term structure. 
Notable exceptions include JCKSQE | (solving a single univariate fewnomial 
over Z in polynomial time), |Lcn98 (solving a single univariate fewnomial 
over Q in polynomial time), and jRojOS] , |MP98| , |Roj99b| , pLS99| l (solving 
polynomial systems over M or C within time near polynomial in the degree 
of the underlying complex variety). 

While it is more or less intuitively clear what it means to solve a polyno- 
mial system over Z or Q, let us state a motivating problem to clarify what 
we mean by solving over M: 

Problem 1.1. Can one s- approximate all the roots of a univariate fewno- 
mial of degree d, within the interval [0,i?], using significantly less than 
8((iloglog— ) arithmetic steps? ■ 



In particular, an important alternative statement is the following: 

Problem 1.2. Can the complexity of solving fewnomials he sub-linear 
in the degree of the underlying complex variety? ■ 

Finding such "super-fast" algorithms is nontrivial, even for binomials 
(i.e., quickly finding d— roots) Ye94| . The asymptotic complexity limit 
stated in Problem |l.l| , up to a factor polylogarithmic in d, is the best 
current bound for solving a general univariate polynomial of degree d over 
C |NR96|. In particular, the existence of faster algorithms for finding just 
the real roots of a degree d fewnomial was unknown until now. 



Our next main theorem gives an affirmative answer to Problem 1.2, for 
certain fewnomial systems and univariate exponential sums over 



precisely, if f{x) = X^og^ CaX°-, where A C 



More 

is finite and the coefficients 
Co are all real, we call / a (real) exponential fc-sum. When ^ C Z, 
we define the degree of such an / to be m&y.a,a' i^aW — a'}. Otherwise, 
we set deg(/) := max{a — a'}/ min{l, min{a — a'}}, where the second 
minimum ranges over all distinct^ a, a' £ A. We also say that / has 
j sign alternations iff there are j distinct pairs (a, a') G A'^ such that 



^Results on fcwnojuiaJajlsually hold on a much broader class of functions: the so-called 
PfafHan functions Kho91 1 . 

^ We declare the degree of any monomial to be 0. 
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CaCa- < 0, ^ n (a, a') = 0, and a' > a. So, for instance, ATx^-^^ ~ 10.3a:°-^ - 
TT — lOx^'^ — x~^-^ has just one sign alternation but x"^ — 2x + 2 has two. 
Finally, when AcZ, we simply call / a fe-nomial. 

Main Theorem 1.2. Let f be any exponential k-sum of degree d with 
at most one sign alternation. Then, given an oracle for evaluating x^ 
for any a;,rGM, one can e- approximate all the root^ of f in (0, i?) using 
0(fc(log d+log log — )) arithmetic operations overM. (including oracle calls). 
In particular, restricting to k-nomials and removing the oracle, we can still 
do the same using 0(fclogd(log(i + loglog -p)) arithmetic operations over 
M, with d agreeing with the usual degree of a univariate Laurent polynomial. 



We point out that even the trinomial case is difficult. For example, 
while one can count the number of real roots of a trinomial of the form 



X +ax + b within ©(log d) arithmetic operations |Ric93| (regardless of sign 
alternations), doing the same for general trinomials was an open problem 
mitil recently | plY99t . Also, even from a numerical point of view, the use of 
Newton's method is subtle for trinomials: It is known that deciding whether 
a given initial point converges to a root of a;'^ — 2a; + 2 is undecidable in the 
BSS model over R (see pCSS98| , Sec. 2.4] and paF56t ). Nevertheless, this 
need not stop us from finding some good starting point, as we will soon 
see. 

Our algor ithm, aside from an algebraic trick, closely follows an algorithm 
of Ye [Ye94] (for a particular class of analytic functions) which efficiently 
blends binary search and Newton's method. By combining these i deas with 
a few facts on the Smith normal form of an integral matrix |[li89|, we 



can also derive the following complexity result on binomial systems. 

Main Theorem 1.3. Let ci, . . . , c„ gM\{0} and let [dij] be any n x n 



matrix with nonnegative integer entries. Finally, let fi := Xj^'^ 
for all i. Then we can e- approximate all the roots of fi= ■ ■ ■ = /„ 



y,din 



+ Cj 

in 



the orthant wedge {xGl 



.>0,J2t^i<R } within 



0{{n + \ogniax\dij\) ) bit operations. 



followed by 



O log|det[d,. 



log^ (n max | ) + log log 



R 



rational operations over . 



If the above binomial system has only finitely many complex roots, then 
their number is exactly |det[dij]|. This follows easily from Bernshtein's 



.and of course count their number 



6 



J. MAURICE ROJAS 



theorem [ |BKK76| . It is also interesting to note that the fastest previous 
general (sequential) algorithms for polynomial system solving over K or C, 
when applied to binomial systems, run in time polynomial in |det[c?ij] 
[ MP98i |Roj99b| , |GLS99| — that is, super-linear in the degree of the under- 
lying complex variety. 

One can of course solve slightly more general systems of fewnomials by 
threading together the algorithms of Main Theorems 1.2 and 1.3. We 
will say more on the likelihood of farther-reaching extensions of our last 
two results after first discussing a result relating complexity classes and 
singularities. 

Remark 1.2. Finding e- approximations of roots within a suitable re- 
gion is far from the strongest notion of solving a polynomial system. In 
particular, the spacing between roots, which of course dictates the e one 
should choose, must be taken into account. A more com plete and elegant 
framework would be to include the condition number , BCSS98 ] of the 
input fewnomial system in all complexity bounds. It is thus the author's 
intent that the preceding fewnomial complexity bounds be interpreted as a 
first step in this direction. ■ 



1.3. Obstructions to Superfast Degeneracy Detection 

The preceding two algorithmic results circumvent degeneracy problems in 



simple but subtle ways. For instance. Main Theorem 1.2 clearly deals with 



equations having at most one positive real root, while the binomial systems 



of Main Theorem 1.2 are easily seen to have no repeated complex roots (cf. 
section H). Thus, the respective hypotheses of these results (restricting sign 
alternations and/or number of monomial terms) allow us to approximate 
roots without stopping for a singularity check. 

It seems hard to completely solve a system of equations without knowing 
something about its degeneracies, either a priori or during run-time. So let 
us present a result which gives solid evidence that detecting degeneracies 
may be quite difficult. In what follows, unless otherwise mentioned, we 



use the standard sparse encoding for multivariate polynomials |Pla84 



Koi96 1 . Thus the size of a polynomial like x"^ + x — A7 will be 8 (log d) and 



not 6(d), whether in the Turing model or the BSS model over C. 

Main Theorem 1.4. Suppose any of the following problems can be 
solved in polynomial time via a Turing machine (resp. BSS machine over 
C). ThenP = NP (resp. NPCBPP). 

1. Decide if an input polynomial /eZ[a;i] (resp. f£<C[xi]) vanishes at a 
d— root of unity, where d = deg(/). 
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2. Decide if two input polynomials f^gEl^xi] (resp. f , g ^C[xi\) have a 
common root. 

3. Given a nonzero input polynomial f G 'L[xi,X2\ (resp. f G C[xi,X2]j 
decide if the curve {(xi, X2) G (C*)^ | f{xi,X2) — 0} has a singularity. 

4. Given input polynomials f,gCz1j[xi,X2,X3, x^] (resp. /, gGMfxi, 0:2, ^3, X4]), 
in the straight-line program encoding, defining a surface S* C M'*, decide if 

S has a singularity. 

5. Given any finite subset Acil? and a vector of coefficients (ca \ aGyl)G 

i^T-esp. gC'^'^J, decide if the A- discriminant of the hivariate polynomial 
TliaeA ^ax"^ vanishes. 



Remark 1.3. Note that in problem (4) we are already given that S is 
a surface. Determining whether this is true or not turns out to be NP- 
hard (r esp. N Ps_- complete) in the Turing model (resp. BSS model over^) 



lKoi%j. 



For any ^cZ", the ^-discriminant, Va, is defined to be the unique 
(up to sign) irreducible polynomial in Zlca \ a G A] such that /^(a;) 



J2aeA Cas;" has a singularity in its zero set (in (C*)") ^Va = [ pKZ94 |. 
This important operator lies at the heart of sparse elimination theory, 
which is the part of algebraic geometry surrounding this paper. 

The ^-discriminant in fact contains all known multivariate resultants 
and discriminants as special cases, and also appears in residue theory and 



hypergeometric functions [ GKZ94 ] . Thus, a corollary of our last main result 
is that sparse elimination theory, even in low dimensions, might lie beyond 
the reach of P. 

Remark 1.4. It is interesting to note that nontrivial lower bounds on 
the complexity of computing A- discriminants in the one-dimensional case 



Aci'Z, are unknown. However, it is easy to show (via [GKZ94, pg. 274]) 
that one can at least find Da in polynomial time luhen ^cZ" has less 
than n -\- 3 elements. ■ 



We will prove our main theorems in order of appearance, but first let us 
return to our study of semi-algebraic sets to see some examples. 

2. COMPARING UPPER BOUNDS ON THE NUMBER OF 
CONNECTED COMPONENTS 

Here we briefly compare our first main theorem to earlier bounds on the 
number of connected components of a semi-algebraic set. 
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In summary, we can compare our new bound to earlier bounds (stated 
in terms of total degree) in very simple polyhedral terms: Let Aq denote 
the smallest scaled standard n-simplex, 7A„, containing Q. Then, since 
volume is monotonic under containment, our bounds are least favorable 
when (5 = Aq. However, in practice it will frequently be the case that Q 
has much smaller volume that Aq, thus accounting for improvements as 
good as a factor exponential in n. 

2.1. At Least One Inequality 

Assume s > temporarily. Letting d be the maximum of the total degrees 
of the /i, the best prev ious ge neral upper bounds, quoted from [ BCSS98 , 
Ch. 16, Prop. 5] and [ Bas96| respectively, were {sd + l)(2sd + 1)" and 
[p + s)"C'(d)". (The first bound is an improved version of a bound due to 
Milnor, Oleinik, Petovsky, and Thom pP49| , |Mil64| , frho6"5| .) Our bound is 
no worse than min{n+ 1, |3y}(2sc?)" (better than both preceding bounds) 
and is frequently much better. Consider the following examples: 



Example 2.1. (Spikes) Suppose we pick all the fi to have the same 
monomial term structure, and in such a way that Q has small volume but 
great length some chosen direction. In particular, let us assume that the 
only monomial terms occuring in the fi are 1, xi, . . . , Xn-i and {xi ■ ■ ■ a;„), 
(xi • • • Xn)'^, ■ ■ ■, {xi ■ ■ ■ Xn)^ ■ Then it is easy to check that Q is a "long and 
skinny" hypyramid, with one apex at the origin and the other at (D, . . . ,D)g 
M". We then obtain, via two simple determinants, thatYolniQ) = D+l and 
thus our bound reduces to min{n + l, ^^}2^ s"' {D + 1) . However, the afore- 
mentioned older bounds are easily seen to reduce to {nsD + l){2nsD + 1)" 
and {{p + s)0{nD))'' . U 



Example 2.2. (Bounded Multidegree) Suppose now that instead of 
bounding the total degree of the fi, we only require that the degree of fi 
with respect to any Xj be at most d' . It is then easy to check that Q is an 
axes parallel hypercube with side length d' . So our new bound reduces to 
min{n + 1, j^}{2sd')'^ . However, the old bounds are easily seen to reduce 
to {snd' + l){2snd' + 1)" and {{p + s)O(nd'))". ■ 



2.2. Real Algebraic Sets 

Assume now that s — 0. Then the aforementioned earlier upper bounds 
resp ectively reduce to d{2d—l)" and {pO{d))". Specializing Main Theorem 
1.1, we obtain a bound which is no worse than 2"^^(i" (neglibly worse than 
the first, better than the second), and is frequently much better. This can 
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easily be seen by reconsidering our last two examples in the case s = 0. (We 
leave this as an exercise.) 

However, let us now make a fairer comparison to another polytopal bound 
— that of Benedetti, Loeser, and Risler |BLR91, Prop. 3.6]. 



Remark 2.5. The bound [BLRQl, Prop. 3.6] was published with several 
typographical errors. Following inquiries from the author, Francois Loeser 
kindly responded via three e-mails with the following corrections!^ in the 
notation of their bound, a hypothesis ofk<n was missing. Also, in part (a) 
of their statement, the quantity $(A) should be replaced by 9^{A), and the 
last sum should be replaced by the main quantity from Prop. 3.1. Finally, 
in part (c), all j's should be capitalized, and 9 should be replaced by 0^_^j. 



The bo und |BLR91 , Prop. 3.6] has a recursive definition based on mixed 
volumes [GK94, DGH98L For the sake of brevity, we will focus on the four 
examples given in | BLR91 |. 



Example 2.3. (Four Examples from |BLR9lj |) Examples (A), (B), 
(C), and (D) of ^BLROj Sec. 4] concern polynomial systems of the fol- 
lowing shape: (A) co + cix'^ + C2y^ (one polynomial, two variables), (B) 
Co + ciXi^ + • • • + c„a;°" ( one polynomial, n variables), (C) cq + c\x + C2y + 
csixy)"' (one polynomial, two variables), and (D) (cq + cix"" + C2y^,C3 + 
C4X^ + C5j/^ + ce{xy)^) (two polynomials, two variables), where the Ci are 
real constants and a,bGN. 

The polytopal bound of \BLR91 j, when applied to these examples in the 
above order, respectively evaluates to 2ab-\-A, 2ai ■ ■ -an+Lower Order Terms, 
8a, and 2ab — b^ -\- Lower Order Terms. None of the preceding lower order 
terms is stated explicitly in lf}LR9\ j, and it appears that the last value is 
incorrect. However, a closer examination of their (corrected) bound respec- 
tively yields 2ab + 4, 2(ai + 2) • • • (a„ + 2), 8a, and 86^ + 6a6 + 8. 

1.1 is easily seen to respectively evaluate to 2ab, 2"^^ai ■ ■ ■ a 



Main Theorem 
4a, and Aab for these examples 



More gener ally, it is not hard to check that our bound is usually better 
than that of |BLR91 when n is small or p is close to n. (Indeed, the 
bound o f [ BLR91 ] does not cover the case p>n.) However, the bound from 
[ BLR91 ] usually wins when p is a small constant an d n is lar ge. The author 
hopes to combine the techniques here with those of |BLR91| in future work. 



'^Professor Loeser states that th ese eorre r.tions were also checked with Jean-Jacques 
Risler, one of the other authors of |BLR91|. 
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3. PROVING MAIN THEOREM 1.1 



We will first prove a sharper version of Main Theorem 1 . 1 for compact 
hypersurfaces, and then successively generalize to the case of real algebraic 
and semi-algebraic sets. Along the way, we give analogues of our upper 
bounds depending only on n, s, and the number of monomial terms. 

Remark 3.6. Throughout this section, "nonsingular" (or "smooth") 
for a real algebraic variety will mean that the underlying complex va- 
riety is nonsingular in the sense of the usual Jacobian criterion (see, e.g., 
l Mum9^ ). m 



3.1. Point-Free Compact Zero Sets of a Single Polynomial 

We begin with the following important special case of Main Theorem 



1.1. This lemma is also frequently significantly sharper than many earlier 



results and may be of independent interest. 



Lemma 3.1. Following the notation of Main Theorem LJ, suppose p=l, 
s = 0, and S is compact but has no zero- dimensional components. Then S 
has at most ^^.^^^ Vol„(Q') connected components, where Q' is the convex 
hull of the union of {O} and the set of all a with a;" a monomial term of 
h- 

Proof: The main idea will be to show that (for n > 2) the number 
of connected components is bounded above by half the number of critical 
points of a projection of a perturbed version of S. This idea is quite old, 
but wc will introduce an unusual projection which permits a much sharper 
upper bound than before. The case n = 1 of our bound is trivial, so let us 
assume n > 2 henceforth. 

Consider / := /i + ^, for some (5 G R to be selected later. By Sard's 



theorem [ Elir94 1 , there is a set C M of full measure such that (5 G 
5*5 = {x G M" I / = 0} is nonsingular (and a hypersurface) . Also, via a 
simple homotopy argument, S and Sf, are both compact and have the same 
number of connected components, f or |(5| su fficiently small. (Much stronger 
versions of this fact can be found in iBas96| .) Furthermore, note that for all 



but finitely many b, no connected component of S'a lies inside the union of 
the coordinate hyperplanes. We will pick (5 7^6 so that all these conditions, 
and one more to be described below, hold. 

Now consider the function x", with a G Z"\{C)} to be selected later. 
Clearly, any connected component of S (not lying in a hypersurface of the 
form x° = constant) must have at least two special points: one locally 
maximizing, and the other locally minimizing, x". Since there are only 



finitely many connected components (by any earlier bound, e.g., |OP49 



and every component contains a curve, there must therefore be an a G 
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Z"\{C)} so that every component (not lying entirely within the union of 
coordinate hyperplanes) contributes at least two critical points of x"". Pick 
a in this way, subject to the additional minor rcstricition that the g.c.d. of 
the coordinates of a is 1. 

Note that the critical points of the function x"" on Ss are just the solutions 
in R" of 

^ ^ ^ dy2 dy^ 

where the yi are new variables to be described shortly. Our final condition 
on S (which is easily seen to hold for all but finitely many S) will simply 
be that all real solutions to the above polynomial system lie in (R*)" :— 
(R\{0})". Note also that a corollary of all our assumptions so far is that 
the number of complex solutions of (★) is finite. (This follows immediately 
from Sard's theorem, and the fact that the complex solutions of (★) form 
an algebraic set.) 

We are now essentially done: The number of connected components of 
S and Ss are the same, and the latter quantity is bounded above by half 
the number of critical points (on Ss) of the function a;°. This number of 
critical points can be computed in terms of polytope volumes as follows: 
Via the Smith normal form |Smi61 |, we can find an invertible change of 



variables on (R*)" such that yi:=x°' and y2, • ■ ■ , 2/n are monomials in the Xi. 
Furthermore, this change of variables induces the action of a unimodular 
matrix on the exponent vectors of /. In particular, / can be considered as 
a polynomial in R[?/j'^^, . . . ,?/^^] and the number of monomial terms (and 
Newton polytope volume) of / is preserved under this change of variables. 
Thus, up to a monomial change of variables, the critical points of the 
function a:" on Ss are exactly the solutions in (R*)" of (★). 

The key to our new bound is to finish things off by picking a bound 
other th an Bezout's theorem here. In particular, by Bernshtein's theorem 



[BKK7£|, the number of solutions in (C*)" is at most the mixed volume 
of Q' and n — 1 other polytopes wi th tra nslates contained in Q' . By the 
monotonicity of the mixed volume | BZ88|] , the latter quantity is at most 



the mixed volume of n copies of Q' and, by the definition of mixed volume, 
this is just Vol„(Q'). ■ 

We point out that a key ingredient in our proof is that the monomial 
change of variables we use (as opposed to the linear changes of variables 
used in most earlier treatments) preserves sparsity. This allows us to take 
full advantage of more powerful and refined techniques to bound the num- 
ber of real roots, and thus get new bounds on the number of real connected 
components. For example, substituting Bernshtein's theorem for Bezout's 
theorem in the older proofs would not have yielded any significant improve- 
ment. 
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However, we need not have been so heavy-handed and only used tools 
over C. We could have also used the following alternative bound on the 
number of real roots. 



Khovanski's Theore m on Real Fewnomials 3.. 
(Special Case) ^ho91 , Sec. 3.12, Cor. 6] Suppose that for all ... ,n} , 



fi€:M.[xi, . . . ,Xn, rni , . . . , mk] has total degree qi , where the mj are mono- 
mials in X. Let k denote the number of monomial terms which appear 
in at least one o/ /i, . . . ,/„. Assume further that the variety S defined 
by fi, . . . , fn is zero-dimensional and nonsingular. Then S has at most 
(1 + q{)^2^''^^^^/'^ Y\ Qi connected components in the positive orthant. ■ 

We call any set of the form {x € | ± xi, . . . , ±a:„ > 0} a closed 
orthant. When all signs are positive we call the corresponding closed 
orthant the nonnegative orthant. The analogous constructions where all 
inequalities are strict are, respectively, an open orthant and the positive 
orthant. 

As an immediate corollary, our proof above yields the following alterna- 
tive upper bound on the number of components of a smooth compact real 
algebraic hypersurface. 



Corollary 3.1. Following the notation of lemma 3.1, assume further 
that S is a smooth compact hypersurface. Then the number of connected 
components of S is at most 2''^^^{n-\- l)'=+i2'^('^+-'^)/^. In particular, S has 
at most ^{n-\- 1)'=2'^''^'^^"'^-'/^ connected components contained entirely within 
the positive orthant. 

Proof: Following the notation of our last proof, note that multiplying 
any equation of (*) by a monomial in yi , . . . , ?/„ does not affect the roots in 
(R*)". Thus, we can assume (★) has only k + 1 distinct monomial terms. 
Also note that the monomial change of variables x y maps orthants 
onto orthants, and that the case n = 1 is trivial. The first portion of our 
corollary then follows immediately from our last proof (using Khovanski's 
Theorem on Fewnomials with qi — ■ ■ ■ = q-n = 1 instead of Bernshtein's 
Theorem), upon counting roots in all open orthants. The second portion 
follows even more easily, upon observing that we do not need 5 if we only 
want to count critical points in an open orthant. H 

3.2. The Case of Real Algebraic Varieties 

The next step in proving Main Theorem is to increase the number 
of polynomials allowed and drop the compactness hypothesis. Again, the 
following result is frequently much sharper than many earlier bounds and 
may also be of independent interest. 
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Lemma 3.2. Following the notation of Main Theorem suppose now 
that s ~0, so that S is a real algebraic variety, not necessarily smooth or 
compact. Then S has at most 2"^^Vol„((5) connected components. 

Proof: The main trick is to reduce to the case considered by our pre- 
ceding lemma. In particular, define Fs^e :=/i H ^ fp + ^i) — & 

R[xi, . . . ,Xn] and let Ss^e be the set of real zeroes of Fs^e- It then follows 
that for sufficiently small (and suitably restricted) S,£>0, Ss^e is a smooth 
compact hypersurface and the number of connected components of Ss.e is 
no smaller than the number of connected components of S. The proof of 



this fact is standard and a very clear account can be found in [BCSS98, 
Sec. 16.1]. 

In any event, the number of connected components of Sg^e is clearly at 
most iVol„(Conv(2(5' U {2ei, . . . , 2e„})), thanks to our preceding lemma. 
Since the last quantity is just ^ • 2"Vol„((5) we are done. ■ 

We can combine the proof of lemma |3.2| with Khovanski's Theorem on 



Fewnomials to obtain the following generalization of corollary 3.1. This re- 



sult, while giving a slightly looser bound than an earlier result of Khovanski 



[Kho91, Sec. 3.14, Cor. 5], removes all the nondegeneracy assumptions from 



his result. 



Corollary 3.2. Following the notation and assumptions of lemma 3.i 
the number of connected components of S is also bounded above by 
4"~^(2n -I- i)fc+i2'=('=+i)/2. 



Proof: Combining the proofs of lemmata 3.2 and 3.1, and since we are only 
counting roots in (R*)", we sec that the number of connected components 
is at most half the number of solutions in (R* ) " of the following polynomial 
system: 



(**) 



--y2- 



dFs, 
dy2 



9yn 



= 0, 



where Fs^e is the variant of Fs^e where we substitute - yf for X^i^l- (^^ 
is a simple exercise to verify that the proof of lemma 3.2 still goes through 
with this variation.) Now simply note, via the chain rule of calculus, that 
every polynomial in (★*) is of degree at most 2 in , . . . , ?/„ and the set of 
monomials appearing in /i, . . . , /p. Also note that the polynomials in (★*) 
are polynomials in a total of fc + 1 monomial terms. So by Khovanski's 
Theorem on Real Fewnomials, and counting roots in all open orthants, we 
are done. ■ 

3.3. Extending to Semi-Algebraic Sets 

We are now ready to prove Main Theorem |l.l| . 
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Proof of Main Theorem |l.l| : We reduce again, this time to lemma 3^ 



The trick here is to note that every connected component of S is in turn 
a connected component of 5" where S' := G R" | fi{x)— ■ ■ ■ = fp{x) — 
0, fp+i{x) ^ 0, . . . , fp+s{x) 7^ 0}. Every connected component of S' is in 
turn a projection (onto the first n coordinates) of a connected component 
of S"', where S" C R"'^^ is the real zero set of the polynomial system 
(/i, . . . , /p, — 1 + zYYj^ l^^ fi)- This reduction is not new and appears, 
among other places, in pCSS98 , Sec. 16.3]. 



Now lemma 3.2 tells us that the number of connected components of S" 
is at most 2" times the (n + l)-dimensional volume of Conv(Pi U (P2 x 
e„+i)), where Pi (resp. P2) is the union of {O, ei, . . . , e„} and the Newton 
polytopes of /i , . . . , /p (resp. the Minkowski sum of the Newton polytopes 
of /p+i, . . . , /p+s). However, it is a simple exercise to show that P2 C 
P3 where P3 is the union of {O, ei, . . . , e„} and the Newton polytopes of 
/p+i , . . . , fp+s, scaled by a factor of s. Now note that P2 C Q, P3 C sQ and 
Conv(Pi U (P2 X e„+i))CConv(Q U (sQ x e„+i)). 

If s > 1 then the last polytope is in turn contained in a pyramid P with 
apex at (0, . . . , 0, and base Q x e„+i. So we obtain that the number 
of connected components of S is at most 2"Vol„+i(P) = 2"|^Vol„(sQ)= 
f±i2"s«Vol„(g). 

If s = 1 then Conv((5 U {sQ x e„_|_i)) = [0,e„+i] x Q. So, similar to 
the previous case, the number of connected components of S is at most 
2"VoWi(P) = 2"nVol„(Q). 

Now note that the number of connected components of S will always be 
at most min{n + 1, |3j}2"s"Vol„((5), with the possible exception of the 
case (n, s) = (1, 2). So we need only check this final case. However, this is 
almost trivial, separating the cases p>{) and p = {). ■ 



We can give an alternative version of Main Theorem 1.1, solely in terms 
of n, s, and k, as follows. 



Theorem 3.5. Following the notation and assumptions of Main Theo- 
rem the number of connected components of S is also bounded above 



by 4"-2 (s + l)"(2(n + l)(s + 1) + l)fc+i2'=('=+i)/ 



The proof is very similar to that of corollary 3.2, save only that we 
substitute the polynomial system from the proof of Main Theorem 1.1 
into the construction of Fs^e- In particular, we eventually obtain a system 
of n + 1 polynomials of degree 2(s + 1) in a total of fc + 1 monomials, 
thus allowing yet another application of Khovanksi's beautiful theorem on 
fewnomials. 
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4. ALPHA THEORY AND PROVING MAIN THEOREMS 1.2 



AND 1.3 



The proof of Main Theorem 1.2 hinges on gamma theory |BCSS9S| 
which gives useful criteria for when Newton's method converges quadrati- 
cally. In particular, we will need the following elementary analytic lemma 



Lemma 4.1. For any monotonic function <j) : M — > M, let 7^ satisfy 

^^Pk>i \ tj'\x) \'^ - IT- Then, for 4>{x) = x'' , we may take 7^ equal to 
\\r\~\, 2 or 1, according as rG(— 00,— 1) U (1, 00), r G (0, 1), or r G (— 1, 0). 
More generally, if (j) = (j)i +4)2 with (pi and 4>2 both convex and either both 
increasing or both decreasing, then we can take 7^ = max{70j , 7^2 } . ■ 

The first part is a simple exercise while the second part is a proposition 



from |Ye94[| 



We are now ready to sketc h th e proof of Main Theorem 1.2, 
Proof of Main Theorem [l.2| : We begin by changing our function / 
slightly. First let M be largest exponent occuring in the fc-sum / and let 
m be the smallest real number so that a;'" is a monomial term of / with 
positive coefficient. (We assume, by multiplying by —1 if necessary, that 
the leading coefficient of / is positive.) By dividing out by we may 
assume that to = 0. Via the change of variables x = y^^^^ , we may further 
assume that M = 1. In particular, we now obtain that / is a sum of two 
increasing convex functions: one a positive linear combination of powers 
of X (with exponents in (0, 1]), the other a negative linear combination of 
powers of x (with exponents in (—00, 0)). 

By our preceding lemma, we may take "if = d (the degree of /) since d is 
no smaller than the degree of our original /. We now invoke the hybrid algo- 



rithm from I Ye94 , Theorem 3] : This algorithm allows us to e-approximate 
the real roots of / in (0, R) using ©(log 7/ -I- log log -^) = ©(log d -I- log log 
function evaluations and arithmetic operations. To conclude the first part 
of this main theorem, inverting the change of variables we made requires 
another O{\ogd + log log ■^) operations via the same algorithm (since tak- 
ing n— roots is the same as solving an exponential 2-sum). However, we 
may have decreased the accuracy of our e-approximation. So we just begin 
by solving to accuracy min{£*^~™, e} instead to obtain the first part of our 
main theorem. (Note also that evaluating / requires k uses of our oracle.) 

To obtain the second part of our theorem, we simply use the same al- 
gorithm without the oracle. This simply introduces another factor of \ogd 
since monomials can now be evaluated by the usual repeated squaring trick. 



Main Theorem 1.3 only needs a special case of Main Theorem 1.2. In 



fact, |Ye94| contains a slightly modified algorithm for the binomial case 
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with an even better complexity bound of ©(log d log log ■^), which we will 
use below. However, we will also require some refined quantitative facts 
about the Smith normal form of a matrix. 



Lemma 4.2. , IliSi / Let A — [a.y] be any nxn matrix with entries only in 



Z and define Ha to be log(27i + max ja^ |). Then, within O* {{n + Ha)^'^'^^) 
bit operations, one can find matrices U , D, V with the following properties: 

1. U and V both have determinant ±1 and entries only in TL. 

2. D is diagonal and has entries only in Z. 

3. UAV = D 

4-det A is the product of the diagonal elements of D and hu, hy = 0{n^{hA- 
logn)2). 



Proof of Main Theorem |1.3| : We begin by immediately applying the 
Smith normal form to our matrix [dij]. (This accounts for the bit operation 
count.) Clearly then, we have reduced to the case of n binomials of the 
form — 7i , . . . , xjj" — 7„ . The real roots of this polynomial system can 



then be e-approximated by n applications of Main Theorem 1.2. Since 
J2i log di — log Hi = I det[dy] I, this accounts for almost all of the second 
bound. 

To conclude, note that we must still invert our change of variables. By 



lemma 4.2, computing this monomial map is almost the final contribution 
to our second complexity bound. The only missing part is the fact that we 
may have needed more accuracy at the beginning of our algorithm. Lemma 



4.2 also tells us how much more accuracy we need, thus finally accounting 



for all of our second complexity bound. I 

5. SMALE'S THEOREM AND MAIN THEOREM 

We begin with the following result of Plaisted. 



Plaisted's Theorem 5.. ; PlaS^J Deciding if an input polynomial f G 



I^lxi] coefficients) vanishes at a d— root of unity, where d = deg{f), is 
NP-hard. ■ 

In the above (and in what follows) / is given in the sparse encoding, so 
coefficients and exponents are measured by bit-length. 

The following unpublished result of Steve Smale gives an intriguing ex- 
tension of Plaisted's result via computations over new rings. 

Smale's Theorem 5.. Suppose we can decide, within polynomial time 
relative to the BSS model over <C, if an input polynomial f £C[xi] vanishes 
at a d— root of unity, where d = deg{f). Then NPCBPP. 
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Proof: Given any complexity class C over the Turing model, consider its 
extension Cc to the BSS model over C. It is th en a simple fact that C is 
contained in the Boolean part of Cc, BP{Cc) jCKKLWOSj . However 



we 

BPP 



will make use of an inclusion going the opposite way: BP[Cc) C C 
[]CKKLW95 1 . Applying this to the problem at hand, we thus see that the 



hypothesis of Smale's theorem, thanks to Plaisted's Theorem, implies that 
NP(ZBP{Pc) = P^^^ = BPP. So we are done. ■ 

Our final main theorem then follows from some simple reductions to 
problem (1) from the statement. 

Proof of Main Theorem [l.4| : First note that the assertion concern- 
ing problem (1) follows immediately from Smale's Theorem and Plaisted's 
Theorem. It thus suffices to successively reduce (1) to special cases of all 
the other problems. 

The assertion for (2) is then clear, since via the special case g{x) = 
x'^ — 1, any polynomial time algorithm for (2) would give a polynomial 
time algorithm for (1). 

On the other hand, a polynomial time algorithm for problem (5) would 
imply a polynomial time algorithm for problem (2). This is because prob- 
lem (2) is essentia lly the d ecision problem of whether the s parse r esul- 
tant of / and g |1GKZ94| is zero. Via the Cayley trick 1GKZ94| , the 



^-discriminant for A = P U {Q x 62) (where P and Q are respectively the 
support^ of / and g) is exactly the sparse resultant of / and g. So this 
portion is done. 

Note also that (3) is just a reformulation of (5). 



As for (4), via the Jacobian criterion for singularities [ Mum95 applied 
to the real and imaginary parts of the input to (3), a polynomial time 
algorithm for (4) (using the straight-line program encoding for the input) 
would immediately imply a polynomial time algorithm for (3) (using the 
straight-line program encoding for the input). Such an algorithm would 
then immediately be a polynomial time algorithm for (3) with inputs given 
in the sparse encoding. ■ 
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